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Quantum receiver is an important tool for overcoming the standard quantum limit (SQL) of 
discrimination errors in optical communication. We theoretically study the quantum receivers for 
discriminating ternary and quaternary phase shift keyed coherent states in terms of average error 
rate and mutual information. Our receiver consists of on/off- type photon detectors and displacement 
operations w/o electrical feedforward operations. We show that for the ternary signals, the receiver 
shows a reasonable gain from the SQL even without feedforward. This scheme is realizable with 
the currently available technology. For the quaternary signals feedforward operation is crucial to 
overcome the SQL with imperfect devices. We also analytically examine the asymptotic limit of the 
performance of the proposed receiver with respect to the number of feedforward steps. 



PACS numbers: 03.67.Hk, 03.67.-a 

I. INTRODUCTION 

Coherent states are not orthogonal to each other and 
then they can not be discriminated without errors. How- 
ever, coherent states have special importance for com- 
munications since they are the best signal carriers. In 
fact, in most practical optical channels where energy loss 
is linear, they can propagate intact in pure states. This 
characteristic is clearly emphasized in the theory on the 
ultimate capacity of a lossy bosonic channel [1], which 
proves that the optimal encoding scheme, to attain the 
ultimate capacity under power constraint, has to employ 
a sequence of coherent-state pulses to code the informa- 
tion messages. Hence, the use of non-classical states at 
the transmitter does not increase the channel capacity. 
Quantum effects are required at the receiver since op- 
timal decoding generally calls for entangling operations 
over a sequence of coherent states [2-6]. The concept 
of 'quantum collective decoding' was first demonstrated 
in [7] where polarization-location coding in a single pho- 
ton was adopted. Recently codeword demodulation for 
coherent states without entangling operation was also 
demonstrated [8] based on conditional pulse nulling [9]. 
However, there are still technical challenges to realize a 
quantum collective decoder for coherent states. 

An important step towards this goal consists in realiz- 
ing a quantum optimal receiver that is not collective but 
that can discriminate each single coherent state with min- 
imum error probability. The discrimination error in con- 
ventional receivers, homodyne and heterodyne receivers, 
is bounded by the shot noise limit, which is often referred 
to as the standard quantum limit (SQL) in coherent op- 
tical communication. On the other hand, Helstrom pro- 
vided a theory to find the ultimate lower bound to the 
error probability [10]. The Helstrom bound results to be 
exponentially lower than the SQL and thus many efforts 
have been devoted to explore how to design practical re- 
ceivers able to approach such a limit. 



For binary signals, it was shown that the SQL can not 
be outperformed by Gaussian operations (up to the sec- 
ond order optical nonlinear processes) with any classical 
conditional dynamics [11], while the Helstrom bound is 
attainable if higher order nonlinearities are freely avail- 
able [12]. One of the currently feasible ways to realize 
nonlinearities relies on using highly efficient photon coun- 
ters. Dolinar proposed an optimal receiver composed by 
a displacement operation, a photon counter and feed- 
back [13]; its performance was demonstrated for the dis- 
crimination of on-off keying signals [14]. Furthermore, 
sub-optimal receivers without feedback consisting only 
of photon counting and optical displacement have been 
also proposed and experimentally demonstrated [11, 15- 
17]. The advantage of adopting such a simpler setup is 
that it easily allows to use a highly efficient photon de- 
tector such as a transition-edge sensor (TES) [19, 20]. In 
[21] it was demonstrated that without correcting any im- 
perfection, with such a simpler architecture it is possible 
to outperform the SQL (the homodyne limit). 

Respect to the binary case, much less attention has 
been paid to the discrimination of M-ary signals with 
M > 2. Bondurant extended the Dolinar receiver to 
quaternary phase shift keying (4PSK) signals, and he 
proposed a sub-optimal receiver consisting of continu- 
ous photon counting and infinitely fast electrical feedback 
[22]. Recently some simpler schemes have been proposed 
and experimentally tested. In [23] it was considered an 
hybrid scheme composed of homodyne and optimized dis- 
placement receivers with feedforward. In [24] displace- 
ment receivers and feedforward were employed and it was 
numerically showed that the proposed scheme is applica- 
ble for general M-ary PSK signals. Such a solution is 
particularly attractive since good performances can be 
achieved with a few number of feedforward stages and 
with moderated detection efficiency requirements. In the 
reported experiments, however, the dynamical feedfor- 
ward was not performed (but 'emulated' [24]), which in- 
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dicates that there are still technical difficulties to realize 
realtime feedforward in quantum receivers. In addition, 
in [24] it was left as a future task the discussion of the 
scalability of the proposed scheme with the number of 
feedforward steps TV. 

In this paper, we theoretically investigate the 
displacement-based receiver for ternary and quaternary 
PSK signals, i.e., 3PSK and 4PSK. Compared to previous 
works [22-24], our contribution includes the following ad- 
ditional aspects. First, we show that even with a simple 
setup without any feedforward or feedback, it is possible 
to overcome the SQL (the heterodyne limit). Although 
the novel receiver requires relatively high detection effi- 
ciencies for photon counting, its implementation is feasi- 
ble with state-of-art photon detectors, e.g. TES reported 
in [19, 20]. Second, we provide analytical expressions for 
the error rate performance of the displacement receiver 
with feedforward, which structure is basically similar to 
the setup given in [24]. We show that the adoption of 
feedforward operations drastically improve the error rate 
performance, and consequently tolerate the requirement 
for photon detectors, in agreement with the results in 
[24]. In addition the obtained analytical formula allows 
one to clarify the scalability of the performance in the 
limit of large TV. We also compare the performance with 
the Bondurant receiver [22] and the Helstrom limit. Fi- 
nally, we present an analysis based on the mutual infor- 
mation of the system including a comparison with the 
unambiguous state discrimination method. 

This paper is organized as follows. In Sect. II, we dis- 
cuss the displacement receiver without feedforward. The 
performance of the receivers with feedforward are ana- 
lyzed in Sect. III. Sect. IV is devoted to the mutual in- 
formation analysis and the paper is concluded in Sect. V. 



II. DISPLACEMENT RECEIVER WITHOUT 
FEEDFORWARD 

In this section we shall propose and describe the struc- 
ture of two receivers, which do not include any feedfor- 
ward operation, for the 3PSK and 4PSK signals. 

The M-ary PSK coherent states |a m ), m = 
0, 1, . . . , M — 1, are defined as 



\a u 



2iri/M 



(i) 



where, without loss of generality, a is chosen to be a real 
number. Throughout this paper, we assume that the a- 
priori probabilities of the signals are all the same, i.e. 
equal to 1/M. The states can be generated as 
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\a m )=V m \a ), ^ = exp(— j 



(2) 



where n represents the photon number operator. 

The displacement receiver consists of beam splitters, 
displacements, on/off detectors w/o feedforward. The 



beam splitter operation B(R) combines and splits two 
input coherent states \ j3) and I7) as 



B(R) | 7 > 



VRP + VI-R1 



(3) 



where its geometric configuration is illustrated in the in- 
set of Fig. 1. Displacement operation D(j) shifts the 
amplitude of coherent state as D(j)\f3) = \/3 + 7). It 
is well known that the displacement operation is imple- 
mented by combining the signal and a local oscillator 
via a highly transmissive beam splitter (for example, see 
[21])- 

On/off detector is a photon detection device observing 
only zero or non-zero photons. The on/off detector is 
described by a set of operators, 



Aoff = e-"£(l-»7) n |n)<r 

n=0 

lion = I — Iloff , 



(4) 
(5) 



where v is the dark count probability and 77 is the de- 
tection efficiency. The probability of finding an off-signal 
when detecting \a m ) is given by 



A. Ternary PSK signals: M = 3 



(6) 



The structure of the receiver for M = 3 is depicted 
in Fig. 1 . The basic operation principle follows the 
same idea lying behind the Kennedy's receiver [15], where 
BPSK signals are firstly displaced, such that one of the 
two signals becomes the vacuum state (signal nulling), 
and then they are discriminated by means of an on/off 
detector. For an ideal photon detector, the vacuum state 
is always determined with no error, while mis-detection 
may occur on the other state. For multiple PSK signals, 
we can extend the same basic principle by nulling con- 
stellation symbols. 

In Fig. 1, the optical signal is split into two branches A 
and B via a beam splitter having reflectance R. After the 
beam splitting, On branch A, the signal is displaced by 
Da(— VRao) while on branch B by Db(— Vl — Rol\). 
As a result, we can see the system as being made up of 
the composition of two separable states, leading to the 
following possible overall two-mode states 



l*o) AB = 

l*iU = 

l*2) AB = 



0) A <g> I VI - R(a - ai)> l 
VR(ai-ao)) ®|0) B 



s/R{a 2 - a o)) A ® I VI - R(a 2 - a 1 )) B 



(7) 

where, to simplify the description, we assume that the 
phase shift due to the beam splitter is compensated by a 
phase shifter. 
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FIG. 1: Displacement receiver with two-port detection struc- 
ture without feedforward operations for the 3PSK case. Inset 
represents the definition of the beam splitting operation for 
coherent states. See the text for details. 



By regarding the vacuum and non- vacuum states as the 
"on" and "off" signals, respectively, and by viewing the 
signals on branch A and B as couples, the above states 
can be referred to as \^o)ab ~ K on ^ on )> I^i)ab ~~ K on > 
off), and \^2)ab - K on ? on )- Then, recalling (4) and (5), 
a straightforward decision rule can be given through the 
definition of the following operators 
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where II3 represents the (off, off) case. 

The channel matrix P(j\i) = \ II j is then com- 
posed by the following elements 
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Following a maximum likelihood criterion, we can as- 
sociate to any outcome a symbol estimate d as follows: 
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FIG. 2: Average error rate for 3PSK signal discrimination 
without applying feedforward, (a) Equal beam splitting and 
exact nulling (blue line), optimized beam splitting and ex- 
act nulling (green line), and optimized beam splitting and 
displacements (red line). 77 = 1 and v — 0. (b) Error perfor- 
mance for different values of 77. v — 10 -6 . In both figures, 
the black dashed and dotted lines represent the Helstrom and 
the heterodyne limits, respectively. 

Hence, for R > 1/2, the average error rate is given by 

1 2 

Pe =1 - - ^2 P(& a m \a m ) 

6 m=0 (11) 

=1 " I W0|0) + P(3|0) + P(l|l) + P{2\2)} . 

We note that when "off" is the outcome on branch A, we 
decide for 0, regardless of the outcome on branch B. The 
analysis can also be easily turned to the case R < 1/2. 
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FIG. 3: Displacement receiver with three-port detection 
structure without feedforward operations for the 4PSK case. 



The error rates derived from Eq. (11) are plotted in 
Fig. 2(a). The blue and green lines are obtained, re- 
spectively, for fixed R = 1/2 and for numerically opti- 
mized R for any given value of a. The performance dif- 
ference between the two setups is small and, for signals 
with \a\ 2 > 2, the proposed receiver remarkably outper- 
forms the heterodyne limit. In the weak coherent state 
region, the receiver performance can be further improved 
by optimizing the amount of the displacements Da(') an d 
Db(') (i- e -7 n °t the exact nulling) as indicated by the red 
line. Displacement optimization was discussed in [11, 16- 
18, 21] for BPSK signals and in [9] for the pulse position 
modulation. We observe that an additional gain can be 
obtained in the weak signal region. In Fig. 2(b) we plot 
the error rate with optimized R and exact nulling assum- 
ing imperfect on/off detectors having v = 10 -6 and dif- 
ferent values for 77. We note that it is possible to outper- 
form the heterodyne limit even with moderate detection 
efficiency. For example, the TES developed in [19, 20] 
already reached r] = 90% and v — 10 -6 and thus the 
sub-SQL receiver could be realized with currently avail- 
able technology. 



B. Quaternary PSK signals: M = 4 

For the 4PSK signal we consider the three ports scheme 
depicted in Fig. 3. The input signal is split into three 
branches A, B, and C by means of two beam splitters 
with reflectance R\ and R 2l respectively. Based on the 
three outcomes, the optimal decision rule can be pursued 
by following a similar approach as for the 3PSK modula- 
tion. It turns out that, different strategies can be adopted 
depending on the value of a and selection of Ri and R 2 . 
By an analytical and numerical study we found that the 
following straightforward strategy can be employed with- 
out any noteworthy performance degradation. 



placed by Da(— y/Ri <^o) and it is detected by an on/off 
detector. If the result is "off" , then the symbol estimate 
is taken as a = ao, otherwise the results on the successive 
branches are considered. At this stage, the probability of 
correct decision for symbol ao results 



P(a = ao\ao) 



(12) 



If the result is "on" on branch 
charge the hypothesis of symbol ao. 

is displaced by 



A, we dis- 
On branch 



B, the signal 



y/(l-Rl)R 2 a* 



d b(~\/(1 ~ Ri)R2 ot 2 ). If the result is "off", then the 
estimate is taken as a = a 2l otherwise the result on the 
next branch is considered. The probability of correct de- 
cision for a 2l is given by the product of the probabilities 
of the events: having "on" on branch A and having "off" 
on branch B, that is 



P(a = a 2 \a 2 ) = (l - e —^Ria^ e -u 



(13) 



Next, if the result is "on" on branch B, then we at- 
tempt to distinguish between a\ and 0^3 on the last 

branch C. So the signal ^/(l — R\)(l — R 2 ) oim) is dis- 



placed by L>c{—\/{1 — R\){1 — R2) ol\), and if the out- 
come is "off" we decide a = ai, otherwise a = as. The 
probabilities of correct decision results 

P(a = a 1 \a 1 ) = (l - e -^vRi^ 

P(a = a 3 \a 3 ) = (l - e - v ~ 2r]R ^ 

x - e -»- 2r i(i-Ri)R2a 2 ^ 

X - e -^-^(l-^i)(l-^2)a 2 ^ ( 15 ) 

Therefore, the average error rate is given by 



1 3 

Pe = 1 - T ^ P (® a m\^r, 



(16) 



m=0 



Figure 4(a) reports the resulting error rates with equal 
beam splitting (Ri — 2/3, R 2 — 1/2, the blue line), 
and optimized R\ and R 2 w/o the optimization of the 
displacements (the green and red lines, respectively) in 
comparison with the heterodyne limit and the Helstrom 
bound. In contrast to the receiver for 3PSK signals, for 
the 4PSK case the optimization of the reflectances is cru- 
cial to provide better performance than the heterodyne 
limit, while the optimization of the displacements is less 
effective. The effect of detector imperfections are illus- 
trated in Fig. 4(b). We note that the requirement on 
detector efficiency is quite severe and the expected gain 
with respect to the heterodyne limit is not as remarkable 
as for the 3PSK case. 
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FIG. 4: Average error rate for 4PSK signal discrimination 
without applying feedforward, (a) Equal beam splitting and 
exact nulling (blue line), optimized beam splitting and ex- 
act nulling (green line), and optimized beam splitting and 
displacements (red line). 77 = 1 and v — 0. (b) Error perfor- 
mance for different values of 77. v = 10 -6 . In both figures, 
the black dashed and dotted lines represent the Helstrom and 
the heterodyne limits, respectively. 



III. DISPLACEMENT RECEIVER WITH 
FEEDFORWARD 



In this section, we discuss the displacement receiver 
employing feedforward operations. The schemes dis- 
cussed in Sect. II were composed of a fixed number M — l 
of branches dependent on the number of signals M . Here- 
inafter, we consider a generalization where the incom- 
ing signal is split into a generic number N > M — 1 of 
branches as shown in Fig. 5. 

The reflectance of the displacement at the nth branch, 
n = 1,2, . . . N-l, is fixed to R n = l/(iV-n + l), so that 
the signal intensity is the same on each branch. In other 
words, we obtain N copies of weaker state \a m /y/N) of 
the incoming signal \a m ) (this could also be done in the 
time domain if convenient). We also assume that the 
value of the displacement at the nth branch, D n (-), can 
be set once the outcome on the (n — l)th branch is ob- 




FIG. 5: The displacement receiver consisting of N- step feed- 
forward operations. 



tained. Final decision is performed considering the out- 
comes obtained on the TV different branches. 

In the following, we detail the detection strategy for 
3PSK and 4PSK signals. 



A. Ternary PSK signals: M = 3 

The use of feedforward operations open to refine the 
decision process. In fact, at the first step n = 1 we apply 
the same displacement Di(—ao/y/N) as for the schemes 
considered in the previous section, however, if an "off' 
signal is detected we do not definitely conclude that a = 
an, but we just keep applying the same displacement also 
on the successive step to further validate our decision. 

Fig. 6 depicts an example of the feedforward tree for 
N = 5 with signal input an. The probability of having 
an "off" outcome at the first step is 



Po 



(17) 



Hence, because of the dark counts, with probability 1— pn> 
the result "on" may occur and, erroneously, the re- 
ceiver try to discriminate between symbols m = 1 and 
m = 2. Therefore, in the second step, the displacement 
L>2(—ai/VN) is applied with the aim of testing hypoth- 
esis m = 1. Then, if an "off" is detected, we maintain 
the same displacement and further proceed to the third 
step; such an event occurs with probability 



Pi 



(18) 



Otherwise, if the detection returns an "on" signal, then 
we just erroneously decide for the signal a = a2- Similar 
operations are repeated up to the TVth step. 
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FIG. 6: Feedforward tree for the 3PSK case where the input 
is ao (rn — 0) and N = 5. 



The resulting decision rule is conveniently summarized 
as follows. When all the detectors output "off' , we decide 
for a = ao- If only one "on" is detected in the first 
N — 1 steps and an "off" is detected in the last step, then 
a = a\. If only one "on" signal is detected at the last 
step, the estimate is randomly made between a = ai and 
a = OL2- Finally, if at least two "on" signals are detected, 
then a = (%2. 

The probabilities of correct decision are then given by 



P(6t = ao |ao) 
P(a = ai \ai) 



Po •> 

N-2 



(19) 



5>i(1-Pi)pS 



N-l-t 



t=0 
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N-2 
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+pf (1- Pl )x-, 



(20) 



P(a = a 2 \a 2 ) = ^ p\ (1 



N-2-t 



t=l 
N-2 
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The average error rate is then equal to 



Pe = 1 - 77 ^2 a m\^m) • 

ra=0 



(21) 



(22) 



Assuming zero dark counts {y = 0) the above equations 



simplify as 

P(a \a ) = 1 , (23) 
P (aiK) = l_l (pf , (24) 

P(a 2 |a 2 ) = 1-1 [(2iV-l)pf- 1 -(2iV-3)pf] ,(25) 

and 
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In the limit of AT — oo, we obtain 
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-e~ 3r]a (2 + 377a 2 ) 
o 



(26) 



(27) 



The performance assuming ideal on/off detectors are 
shown in Fig. 7. The error rate noticeably decreases with 
the increasing of N. Most of the gain is achieved with 
just N = 5, and with N = 10 the performance gets very 
close to the asymptotical bound (27). For a sufficiently 
high signal intensity (such as a 2 ^> 2/(37?)) the bound 
(27), for 77 = 1, approximates as 



(28) 



For 3-PSK signal the Helstrom bound is given by [26] 
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where 



Ao 
Ai 
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with 



= exp(- 



k s exp ( 2 
and for large values of a 2 we find 



) COS (^§-OL 



-a 2 ) sin 



e,Hel 



(31) 



(32) 



Therefore, from the comparison between (28) and (32), 
we note that the asymptotical performance gap between 
the feedforward receiver and the Helstrom bound de- 
pends on the signal intensity a 2 . 

Fig. 8 points out the impact of imperfect detectors on 
the system error rate for different values of TV (77 = 90%, 
v = 10 -6 ). We observe that for large N the effect of 
the dark counts accumulate and seriously degrades the 
performance. For example, for a 2 > 10, the simpler 2- 
port scheme proposed in Sect. II attains better perfor- 
mance than the feedforward scheme. The dependence on 
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FIG. 7: Average error rates for 3PSK signal discrimination 
with iV-step feedforward operation with perfect detectors: 
v — and rj = 1. 
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FIG. 9: Average error rates for 3PSK signal discrimination 
with N- step feedforward operation for N = 10, v — 10 ~ 6 and 
different values of r\. 



the detector efficiency is illustrated in detail in Fig. 9 
for N = 10. The figure shows that the gain due to the 
feedforward could provide more tolerance to detector ef- 
ficiency, in agreement with the results in [24]. Finally, 
it should be noted that the optimization of the displace- 
ments also works for the feedforward receivers although 
the additional gain is small, see Appendix A. 
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FIG. 8: Average error rates for 3PSK signal discrimination 
with Af-step feedforward operation and imperfect detectors: 
v 10~ 6 and j] 0.9. 



B. Quaternary PSK signals: M — 4 

Similar arguments as in the previous section can be 
applied to the 4PSK state discrimination. In this case, 
at the first step, we have to define three different prob- 
abilities of getting an "off" signal after displacement 
Dti-ao/y/N) 



Po 
Pi 

P2 



2r^± 
N 

4rjcy. 2 
N 



(33) 
(34) 
(35) 



The probabilities of correct decision result 



P(0|0) 
P(l|l) 



Po j 

N-3 



(36) 



N-S-t 



N-2-t- 



t=0 s=0 
N-2 

+ Y2 p ui-pi)^- 2 -*(i-pi)x^ 



£=0 



+pf- 1 (l-p 1 )x 



AT-2 



P(2\2) = J^a-pa)^" 1 "* 



£=0 



+P*- 1 (1-P2)X± 
N-3 



N-3-t 



p(3|3) = y^ P *(i- Pl ) J2 rf(i-pi) 

s=0 

-3-t-s 

E P2(1"P 2 ) 

u=0 



£=0 

JV-3-t- 

X 

u=0 
7V-2 



£=0 

+ P r i (i- Pl )xi 



(37) 



(38) 



(39) 



To see the asymptotic behavior for N, let us fix v = 



8 



and then simplify the above equations as 
P(0|0) = 1, 

P(lll) 



1 

1+ 6 



(3N - 5)pf - 4pf - 1 
3(AT-l)pf- 2 "l 



P(2\2) 
P(3|3) 



1 



1-^(^ + 2^ 



N-l\ 



(9JV-ll)pf -(6JV 



1 

1+ 6 



^(iv-iX^-e^- 1 



(40) 

(41) 
(42) 

1 

(43) 



The error rates are plotted in Fig. 10 for ideal on/off 
detectors. The error rate is remarkably improved by in- 
creasing A/", especially up to N « 10. 
In the limit of N —> oo, we obtain 



poo 1 -4r7c 2 , 1 + 6r?a 2 2r?a 2 
^ e " 2 + 4 

which is further simplified for large a 2 and r] = 1 as 

poo 3 2 -2a 2 

i , ~ -a e 
2 

For 4-PSK signals the Helstrom bound reads 



(44) 



(45) 




(46) 



where 



A 
Ai 
A 2 
A 3 



2e _Q;2 (cosha 2 + cos a 2 ) , 
2e~ a (sinha 2 + sin a 2 ) , 
2e~ a (cosh a 2 — cos a 2 ) , 
2e~ a (sinha 2 — sin a 2 ) . 



(47) 



The bound (46) scales for large a 2 as 
p * -2a 2 



(48) 



which again implies that the difference between the feed- 
forward receiver and the Helstrom limit is related to the 
signal intensity through a multiplicative factor a 2 . 

It is also worth noticing that the asymptotic perfor- 
mance of our receiver given by Eq. (45) basically coin- 
cides with that of the Bondurant receiver [22]. The er- 
ror rate of the Bondurant receiver converges to P e ,Bon ~ 
a 2 e~ 2a for large a 2 which is the same as Eq. (45) except 
the lack of coefficient 3/2. The difference is due to the 
fact that the ordering of the pulse nulling is not the same 
(0 -> 1 2 in [22] while we choose 0-^2-^1). Though 
our ordering is not optimal in the asymptotic limit, we 
numerically found that our ordering shows lower error 
rates than that in [22] for small N and also for the re- 
ceiver without feedforward. 
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FIG. 10: Average error rates for 4PSK signal discrimina- 
tion with iV-step feedforward operation with perfect detec- 
tors: v — and rj = 1. 
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FIG. 11: Improved average error rates for 4PSK signal dis- 
crimination obtained by refining the feedforward rule with 
the maximization of a-posteriori probabilities. Ideal detec- 
tors: v — and rj = 1. 



The gap between the error rate of the feedforward re- 
ceiver and the Helstrom bound can be further reduced 
by refining the feedforward rule by adopting the maxi- 
mization of the a-posteriori probabilities as numerically 
demonstrated in [24]. We derive mathematical expres- 
sion for this scheme in the 4PSK case (see Appendix B) 
and report the error rate in Fig. 11 for comparison with 
Fig. 10. 

Figure 12 includes the effect of the detector imperfec- 
tions (rj = 0.9, v = 10 -6 ) into Fig. 10. In contrast to 
the 3PSK case, the scheme without any feedforward can- 
not beat the heterodyne limit for a 2 < 20. It strongly 
suggests that the feedforward would be essential to over- 
come the heterodyne limit in practice. For the dark count 
probability of v = 10 -6 , N = 5 ~ 10 would be a sensi- 
ble choice. Dependence on the detection efficiency is also 
highlighted in Fig. 13 for N = 10. 
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FIG. 12: Average error rates for 4PSK signal discrimination 
with N- step feedforward operation and imperfect detectors: 
v 1(T 6 and 77 0.9. 
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FIG. 13: Dependence on the detection efficiency of the 4PSK 
signal detection with feedforward. N — 10 and v — 10 -6 . 



IV. MUTUAL INFORMATION 

In this section we evaluate the mutual information at- 
tained by the proposed displacement receivers. 

The mutual information is related to the transmission 
efficiency of reliable communication when coding tech- 
niques are employed. Given the channel matrix of the 
transition probabilities [P(?/|x)] between input symbols 
{x} and output symbols {?/}, and the a-priori probabili- 
ties {P(x)}, the mutual information is given by [27-29] 



(49) 

Herein, {x} is the set of symbols m G {0, 1, . . . , M — 1}, 
conveyed by the M-ary coherent states {|a m )}, and {y} is 
the set of estimates m G {0, 1, . . . , M — 1}. The elements 
of the channel matrix are given by 



where {11^} is a set of detection operators. 

The functional meaning of the mutual information is 
as follows. Consider a block coding of length n to trans- 
mit information messages that can be represented by M k 
sequences of length k of symbols in {x}. Here we assume 
k < n. Hence, there are M n possible sequences among 
whose only M k sequences are selected as codewords to 
represent the information messages. There exist (n — k) 
redundant strings that are exploited for error correction. 
The amount of information conveyed by the codewords 
thus constructed is K = k log 2 M bits. The transmis- 
sion rate is then defined by R = K/n = (k/n)\og 2 M 
bits/letter. Now suppose that encoding is made under 
the constraint that the frequency of x's occurring in the 
set of codewords is P(x). Information theory proves [27- 
29] that by using an appropriate coding, one can trans- 
mit the information messages with an error probability 
as small as desired if condition R < I(X : Y) holds. 

The capacity is defined as the maximum mutual in- 
formation with respect to the prior distribution of the 
letters P(x) (for a memoryless channel) 



C = max I(X : Y) 

{P(x)} 



(51) 



In the present context, however, only the input variable 
X and the corresponding set of quantum states are given. 
The output variable Y is to be sought for the best quan- 
tum detection, which is described by the POVM (positive 
operator- valued measure) {n^}. So the capacity defini- 
tion can be formulated as 



C\ = max maxiYX : Y) . 

{P(x)}{u y } 



(52) 



For the ultimate capacity, denoted Coo, one should also 
consider collective decoding on blocks of symbols. Find- 
ing C\ and Coo for M-ary coherent states (M > 3) is a 
difficult task, and it still remains an open problem, as 
well as finding the maximum mutual information for a 
fixed P(x) 



Iacc 



max/(X : Y) , 
{n y } 



(53) 



P(y m\x = m) = (a x \ U y \a x ) , 



(50) 



which is called the accessible information for a given en- 
semble {\a x ) ,P(x)}. 

In the following we numerically evaluate the mutual 
information for the proposed displacement receivers and 
the unambiguous state discrimination [25, 30, 31]. The 
former can be implemented with currently available tech- 
nology, while, nowadays, the latter can be implemented 
in a form very close to the optimal solution [32]. In 
Fig. IV we compare, in the 3PSK case, the mutual in- 
formation attained by the simplest 2-port scheme with- 
out feedforward, the feedforward scheme with N = 3 and 
TV = 10, the unambiguous state discrimination (see Ap- 
pendix C), the heterodyne detection, and the Helstrom 
receiver. We observe that the USD outperforms the het- 
erodyne limit for a 2 > 0.7, but displacement receiver 
with feedforward is generally better. A similar behav- 
ior is observed for the 4PSK case reported in Fig. 14. 
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These conclusions are also in agreement with the results 
obtained for the binary signal case [33]. 
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FIG. 14: Mutual information for (a) the 3PSK receiver and 
(b) the 4PSK receiver. The receivers without feedforward 
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N — 4, 10 (purple and blue, respectively), the unambiguous 
state discrimination (red), the heterodyne detection (black 
dotted), and the Helstrom receiver (black dashed). On/off 
detectors are assumed to be ideal: v — and rj = 1. 



rate with respect to N and clarified the gap between our 
receiver and the Helstrom bound has the order of a 2 . 

The effect of feedforward also provide a remarkable 
gain with respect to the mutual information in particu- 
lar for a 2 < 5. While the USD also shows a good per- 
formance it is comparable with (a 2 > 2) or lower than 
(a 2 < 2) our receiver. Mutual information is the quan- 
tity which eventually determines the total performance 
of communication systems involving coding. It is an im- 
portant future direction to investigate the optimization 
of the system with respect to mutual information, such 
as the optimization of the prior probabilities or the in- 
vestigation of the better POVM consisting of M elements 
with M > M, as suggested by Davies for symmetric sig- 
nal sets [34]. 

Another interesting question is whether the feedfor- 
ward receiver presented here can be applied to more gen- 
eral purposes such as projecting qudit states. For the 
binary case, it is known that the setup discussed in this 
paper is universal in the sense that it can be used for 
arbitrary (destructive) two-dimensional projective mea- 
surement [35]. It is a future task to generalize it to the 
M-dimensional space, that is, to clarify which class of the 
M-dimensional projection measurement can be realized 
with the present receiver setup. 



Appendix A: Displacement optimization for the 
3PSK feedforward receiver 



In the 3PSK feedforward receiver introduced in 
Sect. Ill A, once a photon is detected at an (N— n)th step, 
n = 1, 2, . . . , AT — 1, the estimation hypothesis a = ao is 
discharged and the estimate has to be found between 
symbols a\ and «2. Consequently after a photon is de- 
tected at (N — n)th step, a binary discrimination can be 
performed in the remaining n steps. Hence, by using the 
approach in [17] (see also [18]), we fix the displacement 
of all the n remaining steps to an optimal value (3^ that 
can be found by solving the following transcendent equa- 
tion 



V. CONCLUDING REMARKS 

We theoretically and numerically analyzed the perfor- 
mance of the displacement receivers for the 3- and 4-PSK 
signals. We showed that it could be possible to overcome 
the SQL, i.e., the heterodyne limit, even without apply- 
ing feedforward operations. In particular, demonstration 
of the sub- SQL receiver for the 3PSK is quite feasible 
with the state-of-art photon detection technologies. We 
also showed that the error rate performance is drasti- 
cally increased even for moderate number of feedforward 
steps (N < 5). This means that the requirement for the 
detector specifications can be tolerated, which would be 
important for the 4PSK and agrees with the results in 
[24]. We also derived an asymptotic limit of the error 



^yfa = /3(™)tanh(v^fa/^ 



and we decide for a = a\ if no photons are detected at 
any of the remaining n steps, otherwise, if at least one 
photon is detected, we decide a = (%2. 
The probability of error results 



1 



-3r]a' 



N-l 



where 

~(n) 
Pi 



We note that by setting / 




,(A1) 



+3r]a 2 



~(n) 

P 2 = e 

} = ^r\/¥ a > Le -> h y p er - 
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forming full symbol nulling, Eq. (Al) becomes equal to 
Eq. (26). 

The comparison between Eq. (Al) and Eq. (26) re- 
veals that with this modification just a small additional 
gain can be obtained but only in the weak coherent state 
region a 2 < 2 . 



Appendix B: Optimization of the feedforward 
algorithm via the maximization of posteriori 
probabilities 

Here we describe the improved feedforward algorithm 
used in Fig. 11. In Sect. Ill (except Fig. 11), we consider 
the feedforward algorithm simply change the nulling sig- 
nal with the fixed ordering conditioned on the detector 
click (e.g. -> 2 -> 1 for the 4PSK). On the other hand, 
the algorithm described here dynamically optimizes this 
ordering with respect to the posteriori probabilities at 
each step. In the following, we consider only an ideal 
case, i.e. v = and 77 = 1. 

Suppose we start the detection process by nulling the 
m = signal at the first port, detect the 'off' outcome, 
nulling the m = signal again at port 2, and then obtain 
the 'on' result. Then the input signal is guessed to be 
one of m = 1, 2, 3 signals. More precisely their posteriori 
probabilities are given as 



N-l 



m = : P {) = p (l - p ) = 0, 



m = 1 and 3 : Pi = P3 = pi (1 — pi) , 



where 



2: P 2 =P2(1-P2), 



Po 



Pi 



P2 



_ 4c*l 

e N 



(Bl) 



(B2) 



(B3) 



(B4) 



(B5) 



(B6) 



These posteriori probabilities are compared to each other 
and the feedforward is performed such that the signal 
with the largest posteriori probability is nulled at the 
next port (if more than one signals are equally the largest, 
random guess is applied). Note that such magnitude 
comparison is not straightforward as it depends on the 
signal power a 2 and the number of port TV. 

After tracing all the possible feedforward scenarios, we 
find that the success probabilities of detecting each signal 
are expressed as 



P(l\l)=J2d(* 2 ,N)p s 1 (l-p 1 )p»- 1 -° 

S=l 

N-2 N-2-s 

+ 5>(a 2 ,7V)6(7VK(l-Pi) £ rf(l-Pi)p£ 



N-2-s-k 



s=0 



k=0 



(B8) 



N-l 



N-2-s-k 



P(2|2) = £ e(a 2 ,N)a(N)pf,(l-p 2 )p»- 1 

s=0 

^ N-2-s 

+ Y j d{a\N)pl(l-p 2 ) ^(l-Pi)Po 

s=l k=s+l 

N-3 

+ d{a\N)p s 2 {l-p 2 )Y^p k 1 {\ - Pl ) 

s=l k=0 
N-3-s-k 

x pi(i-pi) 

N-3 N-3-s 

+ d(a 2 ,N)p s 2 (l-p 2 ) £ pj(l-pi) 

s=z(N) k=0 
N-3-s-k 

E rf(l-Pi). (B9) 
P(3|3) = 

N-3 

Y j e{a\N)c{N)p{{l-p 1 ) 

s=0 

N-3-s N-3-s-k 

pi^-pi) P l 2^~P2) 

k=0 1=0 

N-3 

J2d(a 2 ,N)pt(l- Pl ) 

S=l 

-3-s N-3-s-k 

E P2 fe (i-p 2 ) E pia-pi) 

N-2 

+ E d(a 2 , N)pi(l - Pl ) E^(l " P2)P% 



S=l 

N-3-s 



N-2-s-k 



s=l 
N-2 



k=0 

N-2-s 



+ E d(a 2 ,N)p s 1 (l-p 1 ) £ p k 2 (l-P2) P £- 2 - s - k , 



s=y(N) 



k=0 



where a(N),b(N),c(N) and d(N) are 



a(N) = I J 



s > (N - 1) 
s < (N-l) 



P(0|0) = 1, 



(B7) 



b(N) 



1 s > (N - 2) 
s < (N - 2) 



(BIO) 



(Bll) 



(B12) 
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as 



c(N) 



s>(N-3) 
s < (N - 3) 



u 2 *n f 1 s > t(a 2 ,N) 



e(a 2 ,iV) 



{J 



2 v. J 1 S < t(a 2 ,N) 

s> t(a 2 ,N) ' 



(B13) 



(B14) 



(B15) 



and 



t(a\N)= - 2a2+N ff + ^\ (B16) 

Also 2: (TV) and y(N) are non- negative integers satisfying 
the conditions 

^<«<^«, (B17) 

and 

TV - 2 TV - 2 

-__ <2/ (7V) <__ + !. (B18) 

Note that we can derive such an analytical expression 
only for the model without imperfections. Because in 
an ideal model, the nulled signal is never be clicked 
which simplify the possible feedforward scenarios and 
make them tractable by hand. 



Appendix C: Unambiguous state discrimination 

For completeness, we here derive the POVM for an 
optimal USD of the symmetric signals. The discussions 
follow [31]. 

In order to describe the USD we introduce a basis set, 
which diagonalizes the generating operator V in Eq. (2), 



v = exp 

M-l 



u k \uj k ) (u k \ . 



k=0 



Then one can see 

M-l 



M-l 



p = ^ \ a m) ( a m\ = ^2 A m \0Jm) (uJ m \ , 



(CI) 



(C2) 



m=0 



m=0 



where the eigen values A m in Eq. (30) for the 3PSK and 
Eq. (47) for the QPSK. 

The success rate of the USD is given by 



Pusd = minA/e. 

k 

The detection operators are given by 

Am Jf P USD I C*m) ( a m\ 

for the signal state \a m ), using the reciprocal states 



± 1 ^ u mk 



\u k ) 



(C3) 
(C4) 

3 

(C5) 



where A = A fc 1 . They satisfy the orthogonality rela- 
tion 

(o4|ow) = y^5 m , m /. (C6) 



The operator for the inconclusive result is given by 

M-l 



(C7) 



m=0 



By using the POVM consisting of Eqs. (C4) and (C7), 
one can compute the mutual information for the optimal 
USD system. 
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